|
|
Accession Number |
TCMCG021C38728 |
gbkey |
CDS |
Protein Id |
XP_029119768.1 |
Location |
join(27628001..27628033,27628174..27628242,27628332..27628664,27628774..27628977,27630152..27630228,27630413..27630482,27633459..27633511,27633867..27633924,27634010..27634395,27634511..27634583,27635293..27635391,27635469..27635537,27635629..27635706,27635880..27635966,27636048..27636082,27636269..27636374,27637475..27637555,27637648..27637740,27637822..27637875,27637958..27638074,27638172..27638270,27642937..27642984,27643075..27643167,27644710..27644817,27644948..27645012,27645096..27645198,27647721..27648041,27648152..27648286) |
Gene |
LOC105042956 |
GeneID |
105042956 |
Organism |
Elaeis guineensis |
|
|
Length |
1048aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA268357 |
db_source |
XM_029263935.1
|
Definition |
pre-mRNA-processing protein 40A isoform X1 [Elaeis guineensis] |
CDS: ATGGCCAGCAATCCACAGTCTTCTGGGGCACAGCCCCTTAGGCCTGCAGTGGTGGGCTCCACAGGTCCACCTCAAAATTTTGGTCCACCAATGTCTATGCAGTTCAGACCTGTGGTTCCGGCTCAGCAAACACATCAGTTCATCCCAGCCGCTTCTCAGCAATTCCGACCTGTTGGGCAGGGTGTTCCTGGTTCAAATATTGGAATACCTTCTGGTCAAACTCAAATGCACCATTTTCCTCAAGCTGCACAGCATTTGCCTCCAAGATCAGGTCCACCAGCACCACCTTCTTCACAAGCAATTCCAATGCCATATGTTCAGCCAAGTAGGCCTATGACATCTGGGTCATTGCAGCCCCAGCAAAATCCTCAGATGCCAAACACTCATCTGCCTAATTTAGGTGGCATAGGAATGCCTCTTTCTTCATCTTATACGTTTGCAACGTCCTATGGCCAGCCACCAAATACTGTTAATACACCATCCCAGTATCAACCAGCATCCCAGATGCAACCACCCATTACACCTGCAGTGGGACAACCATGGTCTGCACCTGGGACTCAGAATGTACCTCTTGTTACACCTCTGGTTCAGGCTGCACCACAATCTTCAGCTACGGGGGCTGCGGGGGGCACACTTGCTACAACAAATGCACAACCTAGGTCTTCTGAACAGACCTCTTCTGATTGGCAAGAGCACACATCTGCTGATGGAAAAAGATATTATTACAATAAGAAGACTAGACAATCTAGCTGGGAGAAGCCCTTTGAGTTGATGACACCTATTGAGAGGGCTGATGCCTCGACTGATTGGAAGGAGTTCAGCACTTCGGATGGACGAACATACTATTATAACAAGGTCACAAAGATATCAAAATGGATAATTCCTGATGAGCTCAAGTTAGCTCGTGAGCAAGCATTAAGGGCTGCAGCGCAGCAGGCACATACAGAAACTGGAACAGCTTCTGTTGCCCCAGTTGCTCCAACTGTTTCTTCTGGGGAGATCCCTTCCACCACAACAACCTCTGTGATTCCTGGAACAACATCTGCAGTAGCGTCAAGCTTTGTTCCAGTGGCAGTCAACTCAGTTATATCATCATCTGCGATGGCTTCGGTATCACCATCCGTTATACCATCACCCGGAATCTTGTCGAGTTCCTCCTTGGGAATTCCAAGCACAGTTGGCTCATCAAATCTTCAGACAACCATAACGCCATTACCCGCACCTATTTCTGCTAACACGGTGGTGTCTACTGTTGCAACAGAATCTGCATCATCTAACACTAAGAGTAATCATGACAGTTCATCTCTCCTAAATACTGCAAGTGTGCCTGATGGTGCTTCTGCTCAGGATTTGGAGGAAGCTAAAAAGGCTATGCCAGTTACTGGAAAAGTCAATGTCACTCCAATGGAGGAGAAAACAATTGATGAAGAACCTTTGGTTTATTCTAATAAGCAGGAGGCAAAAGCTGCATTCAAAGCCCTACTGGAATCTGCAAATGTTGAATCTGATTGGACTTGGGAGCAGGCAATGAGAGTTATTATTAATGACAAGAGATATGGTGCATTAAAGACGCTTGGTGAGAGGAAACAAGCATTTAATGAGTACTTGGGACAAAGGAAAAAACAGGAAGCTGAGGAAAGGCGCATTAAACAAAAGAAAGCACGGGAAGACTTCACCAGAATGTTGGAGGAATGCAAAGAATTAACTTCGTCAACCCGGTGGAGTAAAGCTGGGACCATGTTTGAGAATGATGAACGGTTTCATGCTGTTGAGCGGCCAAGAGATCGTGAGGATCTATTTGAAAGCTATTTGGTGGATCTCCAGAAAAAGGAAAGAGCAAAGGCTGCTGAGGAGCATAAGCGAAACATAATGGAGTATAGAGCTTTTCTTGAATCATGTGACTTTATCAAGGCAAACACCCAATGGAGGAAAGTTCAAGATCGACTGGAGGATGATGAAAGATGTTCTCGACTTGAAAAAATTGATCGTTTGGAGATCTTCCAGGAGTATATACATGATCTGGAGAAGGAAGAGGAGGAGCAGCGGAAGATACAGAAGGAGCAATTAAGACGGCAGGAACGCAAAAACCGTGATGAGTTTCGCAAATTAATGGAAGAACATGTTGCTGCTGGAATTCTTACAGCTAAAACTCACTGGCGTGATTACTGTATGCAGGTGAAGGATTTGCCTTCTTACATAGCAGTATCATCAAACACATCCGGTTCAACACCAAAAGATCTGTTTGAGGATGTTGCAGAGGAACTCGAAAAGCAGTACCATGAGGACAAGGCCCAGATCAAAGAAGCAATTAAGATCCGAAAGATTGCTTTGGCATCTTCATGGACATTTGAAGATTTCAAGGCTGCCACTTTGGGTGATGATAGCCTAAAAGGAATTTCTGAGACAAACATGAAGCTTGTTTTTGATGAGTTACTGGAAAGGCTTAGGGAGAAAGAGGAGAAGGAGGCTAAGAAGCGCCAACGTCTTGCAGATAATTTCTCTGACCTGCTTTACTCAATTAAGGAAATAACTGCTTCATCTAGATGGGAGGACTGCAAGTCACTGTTTGAAGACAGCCAAGAGTACAGGTCAATTGGTGATGAGAATGTTGTAAGAGAGATTTTTGAAGAACATGTAACCCGCTTGCAAGAAAAGCTGAAAGAGAAAGAGCGTAAAAGAGAAGAGGAAAAGGCAAAGAAAGAGAAAGAAAGAGAGGAGAAAGAGAAGAGGAAGGAGAAAGAAAGGAAAGAAAAGGAAGAAAGGAAGGAAAAAGAGAGAGAACGCGAAAAAGACAAAGGGAAGGACCGATCTAGAAAGGATGAAGCAGAAAGTGAAAATGTTGATGTGATGGACAGCCATGGCTCGAAGGATAGAAAGAGGGAAAGGGATAAGGAAAGAAAGCATCGGAAACGTCACCACAGCATGGCTGATGATGTAAGCTCTGAAAAAGATGATAAAGAGGAGTCCAAGAAGTCCCGCAGGCATAGCAGTGACCGGAAAAAATCACGTAAGCATGCTTACACTACCGATTCAGATAGTGAAAATCGACACAAGAGGCATAAGAAAGATCGAGATGGGTCCCGTAGAAATGGTGGTTATGAAGAGCTTGAGGATGGGGAACTTGGAGAGGATGGGGAAATACGTTAG |
Protein: MASNPQSSGAQPLRPAVVGSTGPPQNFGPPMSMQFRPVVPAQQTHQFIPAASQQFRPVGQGVPGSNIGIPSGQTQMHHFPQAAQHLPPRSGPPAPPSSQAIPMPYVQPSRPMTSGSLQPQQNPQMPNTHLPNLGGIGMPLSSSYTFATSYGQPPNTVNTPSQYQPASQMQPPITPAVGQPWSAPGTQNVPLVTPLVQAAPQSSATGAAGGTLATTNAQPRSSEQTSSDWQEHTSADGKRYYYNKKTRQSSWEKPFELMTPIERADASTDWKEFSTSDGRTYYYNKVTKISKWIIPDELKLAREQALRAAAQQAHTETGTASVAPVAPTVSSGEIPSTTTTSVIPGTTSAVASSFVPVAVNSVISSSAMASVSPSVIPSPGILSSSSLGIPSTVGSSNLQTTITPLPAPISANTVVSTVATESASSNTKSNHDSSSLLNTASVPDGASAQDLEEAKKAMPVTGKVNVTPMEEKTIDEEPLVYSNKQEAKAAFKALLESANVESDWTWEQAMRVIINDKRYGALKTLGERKQAFNEYLGQRKKQEAEERRIKQKKAREDFTRMLEECKELTSSTRWSKAGTMFENDERFHAVERPRDREDLFESYLVDLQKKERAKAAEEHKRNIMEYRAFLESCDFIKANTQWRKVQDRLEDDERCSRLEKIDRLEIFQEYIHDLEKEEEEQRKIQKEQLRRQERKNRDEFRKLMEEHVAAGILTAKTHWRDYCMQVKDLPSYIAVSSNTSGSTPKDLFEDVAEELEKQYHEDKAQIKEAIKIRKIALASSWTFEDFKAATLGDDSLKGISETNMKLVFDELLERLREKEEKEAKKRQRLADNFSDLLYSIKEITASSRWEDCKSLFEDSQEYRSIGDENVVREIFEEHVTRLQEKLKEKERKREEEKAKKEKEREEKEKRKEKERKEKEERKEKEREREKDKGKDRSRKDEAESENVDVMDSHGSKDRKRERDKERKHRKRHHSMADDVSSEKDDKEESKKSRRHSSDRKKSRKHAYTTDSDSENRHKRHKKDRDGSRRNGGYEELEDGELGEDGEIR |